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ABSTRACT. In this paper, we derive upper bounds for the heat kernel of the simple ran- 
dom walk on the infinite cluster of a supercritical long range percolation process. For any 
d > 1 and for any exponent s 6 (d, (d + 2) A 2d) giving the rate of decay of the perco- 
lation process, we show that the return probability decays like t~ d ' s - d up to logarithmic 
corrections, where t denotes the time the walk is run. Our methods also yield generalized 
bounds on the spectral gap of the dynamics and on the diameter of the largest component 
in a box. Besides its intrinsic interest, the main result is needed for a companion paper 
studying the scaling limit of simple random walk on the infinite cluster. 

1. Introduction 

Random Walks in Random Environments (RWRE) are a major topic of research in 
probability theory, with important connections to topics such as Anderson Localization 
and Interface measures from mathematical physics and the study of random matrices 
through the RWRE transition kernel. One simple and fundamental setting for RWRE 
occurs from random bond dilution of an ambient graph G via an i.i.d. percolation pro- 
cess. Among the basic questions is: to what extent such a process affects the macroscopic 
characteristics of the associated Simple Random Walk (SRW)? 

In the context of nearest neighbor percolation on many authors have contributed 
to the understanding of the behavior of the SRW. Let us mention some of the key papers 
in the area: for various models of RWRE, Kipnis-Varadhan [16] introduced "the environ- 
ment viewed from the particle" point of view to derive annealed functional central limit 
theorems; this work was strengthened in Demasi et al. [13] where it was applied to SRW 
on nearest neighbor percolation clusters; Sidoravicious-Sznitman [25] extended the per- 
colation theorem of [13] to the quenched regime on d > 4; Remy-Mathieu [21], Bar- 
low [1] provide quenched heat kernel bounds on super critical percolation clusters, with 
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earlier estimates obtained by Heicklen and Hoffman [14]; and finally Mathieu-Pianitskii 
[20], Berger-Biskup[6] extend [25] to supercritical clusters on Z d ,d > 2. 

In this paper, we consider a variant of this latter problem - SRW on super critical 
Long Range Percolation clusters on Z rf (LRP). LRP, at least in one dimension, was first 
considered by Schulman in [24] and Zhang et. al. [29]. It is a random graph process on 
Z d where, independently for each pair of vertices x,y G Z d , we attach an edge (x, y) 
with probability p x>y . Throughout we shall assume (though this is not strictly necessary) 
an isotropic model: p x ,y = P( || x — 3/H2) where 

lo g p ( r ) n i\ 
~ — » s as r — » 00 (1.1) 

logr 

for some s G R + . There are a number of transitions for the behavior of this process 
as a function of s, one of which is addressed in this paper. The earliest results of this 
kind were given by Schulman [24] and sharpened to the critical case d = 1, s = 2 by 
Newman-Schulman in [23], see also [19]. 

More recently, the long range model gained interest in the context of "small world 
phenomena", see works such as [22], [28] and [8] for discussions. Benjamini and Berger 
[2] initiated a study of geometric properties of a compactified version of this model, 
focusing on the asymptotics of the diameter on the discrete cycle Z/NZ. Their moti- 
vation regarded connections to modeling the topology of the internet, see also [17] for 
a different perspective. In the compact setting, [2] was followed up in [12] for higher 
dimensional discrete tori and more recently by Benjamini, Berger and Yadin [3]. In the 
last paper, the authors study the mixing time T of SRW on Z/ NZ, providing bounds of 
the form 

cN 5 - 1 <t< CN'- 1 log 3 N (1.2) 
asymptotically almost surely where T is the relaxation time of the SRW. We will make 
use of the method of proof from the revised version of the paper [4] which fixes a gap 
in the published version [3]. We note that all of the above mentioned results for SRW on 
LRP assume that nearest neighbor connections exist with probability 1. 

On Z rf , we shall say that the family of probabilities p Xi y = P(||x — 3/H2) is percolating 
if the process admits an infinite connected component. By translation invariance, this 
property is a — 1 event. The Burton-Keane argument [9] implies that if jP(||x — 3/H2) 
percolates, then the infinite component is unique a.s.. It is worth noting however that in 
our setting, this was already shown in [19]. 

Since the work of [24, 23], most studies of LRP on Z rf have focused on conditions 
for and global aspects of the geometry of percolating clusters. Berger [5] studied the 
properties of transience, recurrence of SRW on LRP. Biskup [8, 7] studied the graph (or 
chemical) distance and diameter of the super-critical process in the regime d < s < 2d 
providing sharp bounds, up to lower order. 

1.1 Statement of Main Result. 

Assume that s G (d,d + 2). Let us begin by noting that the one step annealed transition 
probabilities Pi (0, x) are proportional to po /X and thus are heavy tailed and in the domain 
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of attraction of an a = s — d stable law. Given this, a natural assumption is that the 
quenched SRW process will asymptotically behave like an a-stable Levy motion. 

The heat kernel P^(x,y) should thus be expected to decay point-wisely as £- rf /( s - rf ). 
The main result of our paper is to establish a heat kernel upper bound of this order up 
to logarithmic factors. Note, however, that this intuition fails in the case of d = 1 and 
s G (2,3) where [3] showed that the spectral gap of the dynamics on the torus Z/ (NZ) 
is 0(N 2 ) and so behaves instead like Brownian motion. In a subsequent paper [11] we 
will show that the limiting distribution in this case is Brownian motion. 

Let £z d denote that set of unordered pairs of vertices (x,y) such that x,y G Z d . Let 
Q = {0, l} £ z d denote the sample space for LRP on Z d . For each to G Cl,y G Z d , let 
deg w (y) denote the degree of the vertex y in <x>. Let u denote the product measure on Q 
defined by the probabilities (p x , y ){ x , y ) € s zd - 

Theorem 1 Let us consider s G (d, d + 2) for d > 2 and s G (1,2) for d = 1. Assume, for 
simplicity, that there exists L such that 

p x , y = 1 - e-^-y^ for \\x- y\\ 2 > L (1.3) 

and suppose that the p x ,y are translation invariant and percolating. Then there exists an event 
Oi C Q with u(Q.i) = 1, universal constants C\,S > and a family of random variables 
T x (co) with the property that T x (co) < oo whenever x,y G C°°(a;), such that the following 
holds: 

P?(x,y) < C x deg^{y)t- A l^ \o Z & t. (1.4) 
for t > T x (co) V Ty(cv). Moreover on the event E x := {T x (co) < oo} and for any r/ > 0, there 
exists C(r/) > so that we have 

u(T x > k\E x ) < C{n)k-1 (1.5) 

This follows the work of Barlow [1] to the context of long range percolation. We 
mention that Kumagai and Misumi [18] derived heat kernel estimates for the LRP graph 
with d = 1, s > 2; they behave like the nearest neighbor model. Note that in this case, it 
is essential to assume nearest neighbor connections to have an infinite component. 

It is clear from our proof that the same result holds for any percolating translation 
invariant p(x, y) which satisfies (1.1) for s G (d,d + 2), but for convenience of exposition 
we consider the specific form here. We emphasize that we do not assume that nearest 
neighbor edges are occupied with probability one. This leads to significant technical 
challenges and much of our work goes into controlling complications which arise in this 
setting. For instance we must deal with the fact that some portions of the giant compo- 
nent are only connected via very long edges. Our results also provide a refinement on 
the understanding of the geometry of components of the LRP on Z d , one consequence 
of which is the following: 

Theorem 2 With the same conditions as in Theorem 1, let C 1 (N) denote the largest component 
of the graph induced by restricting the process to the vertices £>n(0) := [—N,N] d and let C 2 (N) 
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(1.6) 



asymptotically almost surely. 

Further, there exists a constant S such that asymptotically almost surely 

Diameter (^(N)) < log 3 N, 
where Diameter refers to the graph diameter of the LRP cluster. Finally, there is e > so that 



where the event {0 <-> B C N (0)} denotes that the cluster containing the origin intersects the 
complement of[—N,N] d . 

These statements are proved, respectively, as Lemmas 2.9, 2.12 and Corollary 2.13 
This Theorem extends results of [2, 12, 7] to the setting where nearest neighbor bonds 
do not occur with probability 1. It should be noted that [7] obtains the correct power 
8 = log^ 1 (2d/s) up to o(l) terms vanishing with N and that all three papers consider 
s G (d,2d), whereas we only explicitly consider s G (d, d + 2 A 2d) (for us this is not 
really a necessary restriction for 2 but presumably is for 1 ). Biskup [8] analyses the case 
without nearest neighbour bonds. He bounds the distance of two typical vertices but 
does not bound the diameter in this setting. 

Theorem 3 With the same conditions as in Theorem 2, let C 1 (N) denote the largest compo- 
nent of the graph induced by the vertices [— N, N] d . Then there exists a constant 8 such that 
asymptotically almost surely 



Simple conductance arguments (e.g. [3]) show that this is tight up to logarithmic 
factors. This extends the results of [3] in two directions: firstly to higher dimensions and 
secondly and more significantly to the setting of LRP where nearest neighbour bonds 
are not included with probability 1. 

A main motivation for this work is to provide an important estimate for our compan- 
ion paper [11] where we determine the scaling limits for random walks on supercritical 
LRP clusters s G [d, d + 1) for d > 1. The scaling limit is a Levy process with exponent 
a = s — d which confirms a conjecture of [6]. As mentioned above, in d = 1 we also 
prove convergence to Brownian motion for s > 2 as well. 

It remains open to remove the logarithmic factors in the upper bound of Theorem 1 
which would be necessary to obtain sharp asymptotics analogous to [1]. Our approach, 
following [3], of using the multi-commodity flows to bound the spectral gap of the SRW 
seems to inherently lose logarithmic factors. 

1.2 Proof Outline. The goal is to adapt, in so far as it is possible, the methods for ob- 
taining heat kernel upper bounds employed in [1], particularly Section 3.1. This causes 




(1.7) 



Gap(C l (N)) > N d - s \og s N 
where Gap denotes the spectral gap of the SRW on C 1 (N). 
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several complications: since the underlying percolation process is, in a sense, scale in- 
variant, it is not at all clear how to choose balls on which we may obtain good spectral 
gap bounds. Moreover, since the volume growth of the largest cluster is exponential in 
graph distance (see [8, 27]) none of the standard methods apply to our setting. 

Our solution to these problems are twofold. We study SRW on the graph G, defined 
as the largest percolation cluster inside a ball of £°° radius N about the origin B N (0) 
where N is chosen sufficiently large. Note that because of the scale invariance of LRP, 
this choice of N will be typically much larger than the time scale on which we wish to 
study the walk. While Barlow's argument was made for the particular case of nearest 
neighbour percolation, abstractly it only requires a partitioning of the percolation cluster 
into connected subsets, all of which are "good" in that they possess similar volumes, 
small diameters, and good spectral gap bounds. His proof is inherently multi-scale and 
thus one must adjust the choice of partition on each time scale. 

For each time s we construct a partition V s of the largest component of the large ball 
£>iv(0) into disjoint connected subsets which all have similar volumes, small diameters, 
and "good" spectral gap bounds. As some vertices are only connected to the giant com- 
ponent through very long connections to distant parts of the graph this must be done 
carefully and can not involve only the local considerations. We then adapt the argu- 
ments of Barlow replacing his "good balls" by elements of the partition V s to obtain the 
stated heat kernel decay. 

The issue becomes how to construct V s (and this represents the bulk of the work). The 
basic idea, which in itself is not sufficient, is to consider a re-normalized lattice, tiling 
£>]v(0) by blocks of side-length log 7 N for some appropriately chosen 7. These blocks 
form the lattice points of the re-normalized graph. We proceed by revealing edges in 
a number of stages. At each stage, the remaining unrevealed edges are independent of 
one another and of the construction to that point. 

For the first stage, denote any "microscopic" block by A. We reveal only the edges be- 
tween vertices within A and denote the largest component by C* (A). Biskup [8] proved 
that with exponentially high probability, the size of C* (A) is at least p log** 7 N and we 
say the first step succeeds this estimate holds for all blocks. 

The second stage is conditional on the success of the first stage: we reveal all edges 
between vertices within these largest components C*(A). On the event that the first 
stage succeeds for two neighbouring blocks A, A*, the probability that they are directly 
connected is bounded below by 

P(C*(A) <-> C*(A*)) > 1 - e~^ {2d ' shN . (1.8) 

Thus we may guarantee all blocks are directly connected to their neighbors with high 
probability. We say the second stage succeeds if the largest components of all neighbour- 
ing blocks are connected. This cluster of vertices forms what we refer to as the core for 
the maximal connected cluster of 5^(0). 

On the event that the first two stages are successful, the core defines a re-normalized 
lattice isomorphic to B N/log r N (0). Moreover revealing the remaining edges of the core 
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of B N (0) naturally induces a percolation process on the re-normalized graph, which 
dominates LRP with all nearest neighbor connections present. Furthermore, the effective 
value of j6 for the induced LRP process increases from ft to f> log^ 2li_ ^ 7 N which results in 
a very dense well connected graph. 

If we were studying the SRW on the graph induced by the core, we could apply the 
methods of [4] by using the components C*(A) as building blocks. However, at this 
point we are confronted with another problem: because we do not assume nearest neigh- 
bor connections occur with probability one, after the first two steps there are many small 
clusters left over which are nonetheless part of the largest component, possibly through 
long connections. The main hurdle to overcome is how to allocate small clusters to parts 
of the core so that our diameter, volume and spectral gap bounds still hold. This is done 
through a multistep clustering scheme, which we discuss in detail below. Again we 
reveal edges in stages and take advantage of the independence in LRR 

The rest of our paper is organized roughly in reverse order of this discussion: In the 
next section we study the construction of the core vertices and then demonstrate how to 
allocate small clusters to the core so as to maintain regularity properties. Then we prove 
spectral gap bounds for SRW on these allocated clusters. In Section 3 we adapt these 
probabilistic estimates to obtain the desired heat kernel bounds. 



2. Graph structure 

The construction of the long range percolation measure is standard, but we recapitu- 
late some aspects here to collect notation. We begin by introducing = [— N, N] d n Z d . 
Given a subset A C Z rf , let £ a denote the set of unordered pairs (x, y) such that one of 
x,y £ A. If A = Z d , let £ = £ Z d. An edge b G £ is said to be occupied (relative to 
to G O) if an, = 1. By slight abuse, we shall regard a; as a random graph with vertices 
Z d and edges given by {b : coj, = 1}. 

Recall that, given a connected finite graph G = (V, £ ), SRW on G defines a transition 
kernel P X/V , which as an operator is self adjoint with respect to £ 2 (n) where 

7t(x) ocdeg G (x) (2.1) 

is the stationary probability measure for P. The spectral gap is defined as 

For a connected subset C C w, it will at times be useful to consider the subgraph 
induced by C from to and the SRW on this subgraph. We will denote the transition 
kernels for these walks by 

Pf (x, y) and, respectively P£ f (x, y) (2.3) 

as necessary. 

Throughout, we work with the metric on Z d and various intrinisic graph metrics 
on percolation clusters C. We will generally be careful to note which of these metrics we 
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are using by either an explicit statement or by the notation 

d z *,oo(;-) dc(;-) (2-4) 

respectively. Note that for any Z d , d Z d /CO (-,-) is comparable to d Z d /2 {■,■)> the usual 
Euclidean metric. Let B N (0) be the £°° ball about or radius N. Given N\ < N, 
let G = G(N,Ni) denote a fixed minimal cover of B N (0) by boxes A Nl of the form 
= x + £> Nl / 2 (0) so that | | = N( . Note that we implicitly assumed N\ is even and 
also note also that if Ni does not divide N, this cover may not be unique; neither will be 
of any loss and we fix some deterministic rule for choosing a cover. Let Vn x denote the 
collection of vertex disjoint boxes A Nl whose union is the cover G. 

Theorem 2.1 Suppose that p XiV satisfy the conditions surrounding (1.3). Suppose that d < 
s < (d + 2) A 2d. Then there exist universal constants pi,pi, Q, C 2 , c, C, 5\, 5 2 > so that if 
N > N\ and if G is a minimal cover ofB^(0) by boxes of side-length Ni then there is an event 
A{N,N x )with 

p(A(N,N 1 )) > 1 - C x e- C ^ 2N \ (2.5) 
and on A (N, N\),we can find a partition V of G into connected subsets ifS (0, A Nl )) An G -p N so 
that: 

(1) yoi((f(o,A Nl ))nA Nl ) >piN(. 

(2) Vol (9 (0,A Nl )) <p 2 Nf. 

(3) Diam(& '(0,A Nl ) < clog^(Ni). 

(4) G«^ (0 Ani) > CNf- s log"' 52 NforallA Nl G V Nv 

where the diameter is in the graph distance on the induced components. 
Let us define t x ■ 00 to be the usually shift of the conifiguration co: 

T x ■ w {y,z) = w {x+y,x+z) ( 2 -6) 

Let r x ■ A(N, N\) denote the image of A(N, Ni) under the shift r x : 

r x ■ A(N,Ni) = {co:r x -wG A(N,Ni)}. (2.7) 
Let us define B(e, N, x) = P\ N i-e >Nl>N eT x ■ A(N, N\). Then it is easy to see: 

Corollary 2.2 For all x G Z d and all e > 0, there exists an integer valued random variable 
N e ,x(w), N £/X (co) < 00 ua.s. such that for all N > N eiX (a>), B(e,N,x) occurs. Moreover the 
tail ofN eiX (co) satisfies 

u(N £/X (co) > k) < e- C3l °z 2k (2.8) 

for some universal constant C3. 

The proof of this theorem is a bit involved, taking the better part of the next 15 pages. 
We already outlined the issues encountered in Section 1.2. We begin by building, with 
high probability, a core of vertices, which may be partitioned into connected subsets of 
G(N, N-[) which have good diameter and volume growth in their own right. The second 
step is to argue that we may allocate vertices not in the core to connected subsets of 
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the core in such a way that the diameters and volumes do not change in appreciably. 
By construction we will be able to obtain spectral gap on the partition sets by using a 
multi-commodity flow argument [26] and comparison to the corresponding result for 
the Erdos-Renyi graph with connection probability in the connected regime. 

2.1 Step 1: The Core. Let us denote by C*(A L ) the largest component of A L connected 
inside A^. Let us recall a crucial result from [8]: 

Theorem 2.3 Let d > 1 and consider the probabilities (pxy) X yez d sucn (1-3) holds for 
some s G (d,2d). Suppose that (p X y) x ,yez d are percolating. For each s' G (s,2d) there exist 
numbers p > and Lo < oo such that for each L > Lq, 

p(\C* (A L )\ < p\A L \) < e-P^' . (2.9) 

In particular, once L is sufficiently large, the largest connected component in typi- 
cally contains a positive fraction of all sites in A L . We will only invoke this theorem once. 
Fix, once and for all, s' , p and Lo for which the theorem holds. We choose a set of scales 
on which our construction will take place: With N, Ni fixed, we choose (if possible) 

N = N > Ni > N 2 > N 3 > N 4 (2.10) 

so that 

N 2 = [N[ s ~ d)/d log 3/rf Ni J , N 3 = LN 2 1/2 J and N 4 = [log 27 ^-^ N] . (2.11) 

We remark that, for our objectives, the choice of N 2 is necessary, N3 has a lot of choice, 
and N4 is roughly necessary. For convenience we will suppose Nj divides N,-_i for ; > 2, 
though this is patently false. It is quite standard to modify the proofs to take into account 
this discrepency (for example by working with a dyadic decomposition). 

For any box A^ and for any integer K dividing L, let Pk(A^) denote the partition 
of Ai into boxes of side length K. Let (Vjsij)i<j<A denote a sequence of refinements of 

defined as follows. Let = {A Nl : A Nl C Inductively, having chosen the 
refinement "Pn ; to consist of boxes with side length Nj, let An ; G Pnj- As above we may 
introduce pN j+1 (-^N,)/ a partition of An ; into blocks of side-length Ny + i and then let 

V Nj+1 :=U AN . €VN .V Nj+1 (A Nj ) (2.12) 

Definition 2.4 We shall say that two blocks A lf A 2 G V^ j+l are adjacent ifd^d iCO (Ai, A 2 ) < 
2, where d Z d /Ca is the Hausdorff distance measured with respect to the t° norm on Z d . 

For each A^., we construct a random set ^{A^.) C A^., as follows. We define the 
cores intersection with a block Ajv 4 G as 

#(A N4 )=C*(A N4 ). (2.13) 

and say that A« 4 is occupied if |5f (A^J | > plAjvJ. 

We use interchangeably the terminology that $f (Ajv 4 ) exists, has been constructed or 
that An 4 is occupied. Let denote the c-algebra generated by all edge events co Xi y for 
pairs of vertices x, y G G in the same box in the partition T 3 ^ and let O G J 7 * denote the 
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event that all ^(A N4 ) are occupied. By Theorem 2.3 and a union bound we have that 

u(0) > 1-£>- c1 °s 2n (2.14) 

Let T\ denote the c-algebra 

T ] = a {jF* V <7{w X/V : \x - y\ < Nj G A N; G Vj,x,y G U An4 ca n .^(An 4 )}} 

Let {^(A N4 ) <-> ^(A^ 4 )} denote the event that there is a direct connection from Sf(A N J 
to &(A' N ) and {5f(Ajv 4 ) ^ ^(A^)} denotes the complimentary event. Let A denote 
the event that {^(A^J <-> W(A' Ni )} occurs for each pair Ajv 4 , A^ 4 of adjacent blocks. 

Definition 2.5 Let us define the core to be the graph ^ = ^ with vertex set U a N4 <=v Ni & (^n 4 ) 
and edges given by revealing all edges of£°° length at most N2 between these vertices. Similarly, 
sampling edges inside Ua n ca n .^(An 4 ) of length at most N2 A Nj, call the derived graph 

Lemma 2.6 There exist constants 8\,C\,C2 > such that for all N > N\ > N2 > N3 > N4 
chosen as in (2.10), there exists an event 

Co{N, Ni) = Co{N, N lt N 2 , N 3 , N 4 ) (2.15) 

with 

fi(Co(N,Ni)) > 1 - c 1 e~ C2lo ^ N (2.16) 
satisfying the following properties: 

• Connectedness: &(A N .) is connected in A Nj . Ifd Z i (A Ni ) (A' N J) < N4 then 
t ^(A^ 4 ),W(A' Ni ) are directly connected. 

• Volume Growth: |Sf(Ajv.)| >p|A^.|. 

• Diameter Bounds: Diam (&(A Nj )) < N%log Sl Nj for all A Nj for j = 2,3. 
Also, we have 

• Localization: Sf( A^ .) C A^.. 

• Measur ability: Sf(Aw.) is measurable with respect to the a-algebra J-~2vj- In particular 
the construction of£ does not call on edges longer than N2. 

We remark that the diameter bounds here are not optimal. Bounds of the correct order 
have been obtained in [8]. By our choice of the Nj, we could have absorbed N| into 
log 1 Nj, but we keep in this form to track the dependence of estimates on our choices. 

The key observation is that if we regard {^(A]v 4 ) : A^ 4 C A^.} as vertices of a course 
grained lattice in a box B Nj/Ni , then on the event O n A, sampling all edges between 
all remaining vertices of {^(A N4 ) : Ajv 4 C A Nj } stochastically dominates long range 
percolation on a box of side length Nj/N, with all nearest neighbor edges present. Indeed, 
if 

d z * /O0 (&(A N4 ),&(A' N4 )) < IN, (2.17) 
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then 

K^(An 4 ) - ^(A'mJ I O) < e-* d, V(^)-^t s (2.18) 
First we note that with a union bound this bound implies that 

fi(A\0) > l- e - cl °s 2N (2.19) 

Further on O D A, the re-normalized block percolation process determined by the edges 
between these W(A^ i ) stochastically dominates long range percolation with exponent 
s and B = B^ 4 = N 2d ~ s where nearest neighbor blocks are conditioned to be connected with 
probability one. Note that by our definition of N4, this leads to a highly connected graph. 

Note that &(A Nj ) C for all A Nj G V Nj ,j = 1,2,3,4. Before giving the proof (most of 
which is clear by construction) we record a diameter bound separately for easy reference. 

Lemma 2.7 Consider the family (^(Nj) AN on the event O DA. Then 

p(3A Nj : Diam (#(A N .)) > log 15 NyNf some A Nj \0 n A) < 2 d N d e- N ^ s (2.20) 
forj = 2,3. 

Proof. By (2.18), we may apply: 

Corollary 2.8 (Corollary 5.1 from [12]) Let Bk be a box of side length K. and consider long 
range percolation on Bk with exponent s and parameter & and nearest neighbors connected a.s. 
For any C > there exists 5 such that 

p(Diam (B K ) > log s K) < e -^s c ^). (2.21) 

We amplify this estimate via the observation from (2.18) that the course grained pro- 
cess with vertices (f£ (A Ni )) An cAn dominates a long range percolation process, denoted 

by CK, with exponent s and B Ni = Bp 2 N^f' s . 

The point is that we can instead view CK as a union of [p 2 Nl d ~ S J independent iden- 

tically distributed long range processes (CTZj)^ 4 with exponent s and parameter 
B' > B. By slight abuse of notation, we let p denote the coupling measure between the 

process on (Sf (A N4 ))a N4 cA n and the family (CTZj)f =1 4 . 

Thus applying the Corollary to each independent copy, we have 

fi($j : Diam (Olj) < log* 5 NjN% some A N; |C n A) < e - Lp 2 Nf- s J e(log c N ; ) (2 22) 

Taking a union bound finishes the proof. □ 

Proof of Lemma 2.6. The volume growth, connectedness, localization and measurability 
claims all follow by construction, on the event that O D A. By equations (2.14) and (2.19) 
this event holds with probability at least 1 — e^ clo ^ N . The diameter bound holds by 
applying Lemma 2.7 and taking a union bound. 

□ 
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2.2 Step 2: Allocating Small Clusters to the Core. 

Recall that we are aiming to show that we can, with high probability, partition the 
largest cluster of a minimal cover G of Bjv(O) into small subsets, each of which has sim- 
ilar volume and spectral gap. This has already been partially achieved for the core ( S. 
Significantly for our future bounds on spectral gaps, was constructed without knowl- 
edge of edges longer than Afc. 

This core is quite large, and the next lemma shows (among other things) that with high 
probability it is in the largest cluster. Our partition, however, must include all vertices 
in the largest cluster, not just those in the core. We proceed to allocate the remaining 
vertices not in the core to the subsets ^(A^) so as to not alter the volume, diameter 
and (therefore) spectral gaps of the ^(A^) already constructed by too much (Note: we 
have not yet addressed the spectral gaps ^(AjvJ, we will directly analyze the gap of 
our full partition). The following lemma justifies our analysis of the core, showing that 
with high probability it is part of the largest component, and furher serves as a model 
for future development: 

Lemma 2.9 Let ti\ > n.2 > ■ ■ ■ > n m enumerate the cluster sizes inside G (having sampled 
all internal edges). Then there exists C3, C4, 62 > such that Then 

u(n 2 > N s f d log 2 N) < c 3 e- c * l °z 2N . (2.23) 

Moreover, the core & is a subset of the largest component except with probability 0(e^ Cilo ^ N ). 

Proof. On the event O, we next reveal the edges connecting all vertices in ^ c D G and 
denote the tJ-algebra generated by these edges as B. The set ^ c n G is broken into a 
sequence of clusters 

C\C Z ,...,C M C& c r)G (2.24) 
where the edges between and (C 1 , . . . C M ) have not yet been revealed unless they were 
revealed in T*, that is pairs of vertices which are in the same box in V Ni . On O the core 
has positive density down to the level N4. Thus 

u(C ^ <S\B,T+, O) < e-Wf-lcl (2 25) 

since each for vertex x G C there are at least pNf vertices in the core in an adjacent 
block in 7^4 and x could be connected to them each independently with probability at 
least 1 — e~ 2 ^ N 4 S l c 'l. As the total number of clusters is bounded by 2 d N d , by a union 
bound except with probability 2 d N d e~ 2 P N 4 s l c 'l there is no component of size greater than 
Nl~ d log 2 N which is not connected to the core. 

On the event Of) A, which occurs with probability at least 1 — e~ clo 8 2 the core forms 
a single component of size at least pN . Hence the core is in the largest component 
and the second largest component is of size at most N^ d log 2 N except with probability 
C3e -c 4 log 2 N as re q U i re( j_ 

□ 
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To get around issues outlined above, we will reveal edges with endpoints in <$ c n 6 
in stages. Let us introduce notation to facilitate this step. Let Hnj denote the filtration 
defined by 

H Nj = o-{T 2 V cr{a) Xi y : \x - y\oo < Nj,x,y G G}} ■ 
that is T2 along with all edges in G of l°° length less than Nj (only a nontrivial construc- 
tion if = 3, 4. Let SC-^. denote the graph induced by & and sampling all edges in 6 of 
length at most Nj (thus SE^. is measurable with respect to W^). 

Let C(j,&) denote the connected component of in this graph. For x G C(j,W)\W 
we define an allocation <pj(x) (formally a function from C(j,&) to Vn 2 )- Intuitively we 
would like 

cPj(x) = A N2 if d Xli . (x, & n A Nl ) = d# Nj (x, ) (2.26) 

however this definition has two problems: First, there may be ties and so strictly speak- 
ing the intuitive allocation is not well defined; Second, it is important that (pj 1 (Ajv 2 ) is a 
connected subset of the maximal component. 

Thus, to define <pj on C(j,&) we proceed inductively. Consider the sets 

R k (G) = {xe C(j,<$) : d Xiij {x,Q) < fc}. (2.27) 

Here d^ N (-,-) denotes the ^ -graph distance. Now let A\(A^ 2 ) = {x G Ri(G) '■ 
d^ N (x,Q fl Ajv 2 ) < 1} with ties broken according to some deterministic rule. Induc- 
tively, suppose that A k (A^ 2 ) have been defined and let 

A k+ i(A N2 ) = {x G R k+1 (G)\R k (G) : d^ N .(x,A k (A N2 )) < 1} (2.28) 

again with ties broken according to some deterministic rule. Finally, let 

(Pj(x) := A N2 if x G U k A k (A N2 ). (2.29) 

Then if 

A N2 ) = (?n A N2 ) U {x : <Pj(x) = A N J, (2.30) 
A N2 ))a N2 c^ defines a partition of C(j,&) into connected subsets. 

2.2.2 Allocation Phase 1. We shall need a simple bound on the maximal degree of a vertex 
in B K (0). Let deg w (x) = £ yGZrf l Wxy=l and Deg(A K ) = max ieAj( degf(x). 

Lemma 2.10 Then V<5 > 0, 3C(<S) > such that for all t > 0, we have 

u{Deg(A K ) > MogK) < C(6)K d e- SHo Z K . (2.31) 

Proof. Using the exponential Markov inequality and independence of the summands of 

deg w (0), 

p(<f(0) > logJC) < e-^^^^C)]. (2.32) 

But 

logE,[e^(°)]<C L + £ - l)||x||- s < C^(^) + - l)L d - s . (2.33) 

||x||>L 

Taking a union bound finishes the proof. □ 
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Lemma 2.11 There exists pi,p2/ c 5/ c 6/^3 > such that for any N > Ni > N 2 > N3 > N 4 
as in (2.10) the following holds: Consider the graph 3En y We can find an event E C Co(N, N\) 
with E € Hn 3 such that if (& (3, Afj 2 )) denotes the partition ofC(3,&) into connected subsets 
from (2.30) then on E 

Diam(9(3,A N2 )) < C 2 log s N 2 + N^lo^N (2.34) 

and 

Pl Ni < Vol(W(3, A N2 ) ) < piNi (2.35) 
for all Ajv 2 . Also, for any component C C JTn 3 so that C 7^ C(3,Sf), 

Vol(C) < Nl~ d log 2 N. (2.36) 

Finally 

KE|^ 4 )l Co(N , Nl) > 1 - c 5 e-^ N l Co{N/Ni) (2.37) 

^ fl.S. 

Proof. Let Un 4 denote the conditional measure induced by u(-\Co(N , Ni) ,Hn 4 ) ■ Con- 
sider the law of JTn 3 under uni and recall the definition of &(3, A^ 2 ) (2.30). Let Bg(x) 
denote the ball of x with radius £ measured in the graph distance on JTn 3 . Let L(x) denote 
the least £ such that \B({x) \ > N| log N. Then the same argument as in Lemma 2.9 

Fn 4 (B l(x) (x) = 0,L(x) < 00) < e - cl0 § 2N (2.38) 

^ a.s. and so 

u Ni {3x such that B i(je) (x) = 0,L(x) < 00) < 2^^^'°^ (2.39) 

u a.s. Thus with high probability, N3-clusters which exceed log 2 N in volume are 
directly connected to the core. Since L(x) is bounded by N4 log 2 N on the event that 
L(x) < oo, we have good diameter bounds for the sets (3, An 2 )- 
By adapting the proof of Lemma 2.10 with probability at least 

l-ce- lo s 2N (2.40) 

no vertex has degree exceeding log 2 N after we sample all edges with length no more 
than N3. Thus, 

\B L{x) (x)\ < log 2 N||B LW _ 1 (x)| < Nr d log 4 N. (2.41) 

Let 

B c = {3x such that B L{x) (x) n & = 0, L(x) < 00} (2.42) 
Then on H,if x e ^(3,A Nz ) 

d zd/CO (x,A N2 ) < N 3 L(x) < N 3 Nr ii log 2 (N) (2.43) 

thus 

pNi < W(3,A N2 ) < N| + 2d2^ 1 N^ 1 (N 3 N^ li log 2 (N)). (2.44) 

Combining the estimates gives the result. □ 
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2.2.2 Allocation Phase 2. Now we will work under the conditional probability measure 
determined by the u-algebra Hn 3 and the event E from Lemma 2.11. We must account 
for the small clusters of S^n 3 which connect to C(3,£f) after revealing the remaining 
edges between C(3,Sf ) and C S C fl G. Let C\, . . . ,C m enumerate the small clusters of SC-$ y 
As was observed above, on E (which is Hn 3 measurable), the small clusters C\, . . . C m 
of JTn 3 have size at most N| -d log 4 N. Further, the only way to connect these clusters 
together (or, for that matter, to connect them to C(3,£f)) is via revealing bonds of P° 
length at least N3. These two conditions prepare the way to dominate cluster sizes by 
subcritcal branching processes. 

Let 3C denote the graph obtained by adding all edges in G x (G fl <3 C ) to the graph 

JTn 3 - Let Cj ^> C(3,Sf) denote the event that Cj is connected to C(3,Sf) after all edges 
outside §f have been sampled. Consider the set V := U x Cj. We may define a 

secondary allocation xp :T> — » using a mechanism entirely analogous to that leading 
to Definition (2.30). Let 

W(0,A N2 ) = ^(3,A N2 ) U {x G V : t^(x) = A N J. (2.45) 

Note that even without sampling the remaining edges of & , (^(0,An 2 ))a N2 c^ defines 
a partition of the largest component of G. Thus it does not depend on edges between 
^(An 2 )a N2 c^ with length larger than N 2 . 
We aim to prove the following: 

Lemma 2.12 There exist universal constants C7, Cs, £4, P3, ,04 > so that for all N > N\ > 
N2 > N3 > N4 chosen as in (2.10), there exists an event FcO with 

u (F I E, H N3 ) > 1 - c 7 e- c « lo s 2 N (2.46) 

on if/n'c/z the following holds: For all A Nl G V^ 2 , 

Diam (0, A Nl ) ) < N^ rf log 3 N + Dwm (3, A Nl ) ) (2.47) 

and 

p 3 N$ < |^(0, A Nz ) I < p 4 N(. (2.48) 

Also, ^(Ajv 2 ) C @(0, An 2 ) and ^ (0, A^ 2 ) form a partition of the largest component of G 
into connected subsets. 

Proof In this proof we work under the conditional measure u{- \Hn 3 , E). Denote the set 
of connected components in JTn 3 disjoint from the core as S. We now reveal all edges 
not observed in Hn 3 between clusters in S. Note that all of these edges must be of length 
greater than N3. For each component C G S let 

deg(C) =#{{x,y) :xeC,ye C, \\x - y\\oo > N 3 ,cv {Xry) = 1 for some C G S}. (2.49) 

Then for each C G S 

E,(deg(C) = k\H N3 ,E) < ^Nt 5)k - (2-50) 
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Consider the course grained graph CG$ = {Vs,£s) obtained after revealing all such 
long edges between the Cj £ S and then contracting the components of S to points. Let 
C(Cj) denote the connected component of the contraction of C, in this graph. 

Since on E we have a uniform upper bound on {\Cj\ :Cj € S} of N|" d log 2 N, for each 
i, we can dominate DiamC(Ci) using a subcritical branching process with mean birth 

r] = N s - d log 2 (N) N$~ s (2.51) 

If X n denotes a branching process with this mean tj, we couple X n to C(C;) so that births 
occur whenever we see a new (contraction of) Cjt, exploring C (C, ) starting from Cj. As 

P(X„ > 0) < tj n . (2.52) 

we easily conclude that 

}i(3Ci : Diam(C(C;)) > logN\E,H N3 ) < N d r] lo % N . (2.53) 

Again, since the size of each individual cluster C, is bounded by N^ d log 2 N on E, 
there exist constants C\,ci > so that we have, with conditional probability at least 

Diam(U /6(i(c . ) C / ) < N s f d log 3 N (2.54) 

for all C; G 5. Let 

F x := {Diam(C(C;)) < log NVC,- G 5}. (2.55) 
At this point we conclude that no matter how the cluster ^j e c(C )^j cormec ts toSf (3, A^), 

Diam(^(0,A N2 )) < N s f d log 3 N + Diam(^(3, A Nz )) (2.56) 

on Fi n E. 

Next we consider volume growth. We can bound |C(C,-) | by the total number of off- 
spring T of the process (X n ) ne j<r. We have 

oo 

\C(d)\ <1 + T = 1+£X„ (2.57) 

n=\ 

By (2.50) we have that 

E[e AXl ] < e e Y (2.58) 
Thus if (Y(z')) denotes an i.i.d. sequence distributed as Xi, then 

E[e AX "] = E^iV 1 Y (0] < E[e x »-^i] (2.59) 

Iterating the bound using the natural filtration for (X„)~ =1 , the monotone convergence 
theorem implies 

E[e AT ] < (2.60) 
where /(A, 7/) is the minimal solution to the equation 

/(A / J7)e~ /(A ' ,) =A (2.61) 
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provided rj is sufficiently small (depending on A). Therefore, for any A > 0, there is 
77 = 77(A) such that if N^^log^N)!^" 5 < rj, then 

E [ e A|c(c,)|| E/WN j < e /(M) (2 _ 62) 

so that we may conclude 

: \C(Ci)\ > log 2 N\E,H N3 ] < N d e- M °s 2N ef^\ (2.63) 
Therefore, we can find constants c~ 3 , c~4 so that with conditional probability at least 1 

c 3 e 



-C4 



log 2 N 



yo/(U /Ge(Ci) C ; ) < Nt rf log 4 N VC< e 5. (2.64) 
The only thing left to do is make sure we don't allocate too many clusters ^j e C{C-)^i to 
the same connected subset (3, A Nl ) . According to our work in Phase 1 of the allocation, 
on E we have 

piNi < \&(3, An 2 ) I < pi^i (2.65) 
We have not yet revealed the long edges between V and (3, A Nz ) for any Ajv 2 . This is 
the next step. Let 

A^( 3 ,a N2 ) E E a H,, y ri}- ( 2 - 66 ) 

JG^(3,A N2 ) ||y-x|| 00 >N 3 

Then A/"^( 3;An ) is the sum of conditionally independent random variables and in fact 
is conditionally independent of the connectivity of U- 6( *^Cy. If we separate the sum- 

mands via length, then for each r £ [N" 3 , N] Pi Z, we have at most r P2N2 independent, 
approximately identically distributed summands of distribution type Ber(p = r~ s ). To 
finish the argument, we use large deviations to bound the number of long edges ema- 
nating from any set (3, A Nz ). Let 



Var Na (N 9{3A )) =E f , 



^(3,An 2 ) ~ E h[ A ^(3An 2 )I E '^N 3 ]) |£/^N 3 ^ (2.67) 

denote the conditional variance of N^( 3An ). Then the Azuma-Hoeffding inequality im- 
plies that for any A, t > 

f(l%(3,A N2 ) -E f( [% (3/A]V2) |£,W N3 ]|/^y«r N 3(% (3 , A]V2) ) > N£\E,H N3 ) 

< e -m//2 (26g) 

Taking a union bound and optimizing over t, 



fi(3A N2 : |N^ (3/An2 ) -E^[N^ ( 3 /An2) |£,H N 3]|/ JVar^N^^) > N£\E,H N3 



N\ d _ N 2A /2 



< ( __ ) e -iv r/ ,_ (269) 



Since 

rrf -\jd — S 



E F [%(3,a N2 ) |E, W N3 ] < dN 2 *N 3 *- s (2.70) 
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and 

Var N3 (N n3 , AN2) ) < C 2 N d 2 Nt s (2.71) 
choosing N 2 A = N d/2 N^ s)/2 gives 

¥ (3A N2 :N n3 , AN2) >2C 1 VC 2 NiNt s \E,H N3 ) < (j^ e~^ /2 . (2.72) 
Putting (2.72) together with (2.53), we find 

u(3A Nl : Vol(&(0,A N2 )) > p 2 N^ +2Q V C2N2 3 Nl~ d log 4 N\E,Hn 3 ) 

d 



<(Jp) e - N 2 N t s + N d e- xl °S lN ef^ (2.73) 



□ 

Before moving to the the derivation of heat kernel bounds, let us pause to record the 
last statement of Theorem 2 as a corollary to the previous Lemma: 

Corollary 2.13 IfC 1 (N) denotes the largest component in Bn(0) := [-N, N] d , there is e > 
so that 

u(o^ B C N (0)\0 £ C X (N)) < CN- £ . (2.74) 

Proof. This is a simple application of the branching process argument exploited in the 
previous lemma. The proof is omitted. □ 

2.3 Step 3: Assembling the Estimates. Next we derive the required spectral gap bounds. 
Let H.+ denote the c-algebra generated by the construction so far, that is T^ 2 , plus all 
edges between vertices in ( S C n 6 - leaving out only edges inside the core of length 
greater than Afc. Let us sum up our construction to this point: There exist universal 
constants C9,c w , 5^,ps, pe > so that if (Ni)f =0 are chosen as (2.10) then we can find an 
event R with u(R) > 1 — c 9 e~ Cwlo s N , measurable with respect to H + and so that on R: 

(1) <S exists and satisfies the properties of Lemma 2.6. 

(2) The connected components of & are measurable with respect to Ti + . In other 
words, adding the states of edges which have not yet been revealed does not 
change the connectivity. 

(3) On R the small components of C have size at most N%~ d log 4 N. 

(4) The largest component Jl of 6 may be partitioned into connected random sub- 
sets (^(0,An 2 ))a N2 £P N2 so that 

• Diam(^(0, A Nz ) < log 1 * 5 N 

• p 5 N d < |^(0, A Nz ) I <peNl 
. Sf(A N J C^(0,A N2 ) 

(5) Finally edges of length at least N2 between vertices in have not been sampled 
and are thus conditionally independent of R. 
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At this point we are in a position to apply the strategy of [4] to obtain the bounds on 
gaps of individual sets associated with the which tile G. The idea is as follows: for 
each Ajv 17 consider {Sf (0, An 2 ) : A« 2 C A^}. We now sample the edges between the 
corresponding (0, A^ ) . Define 

&(0,A Nl ) := U An2 ca Ni ^(0,A N2 ) (2.75) 

along with these newly sampled edges. We will bound the spectral gap of (0, A^J 
using multi-commodity flows [26]. 

A Brief Primer on Multicommodity Flows (exposition taken from [4]): Let P be the 
transition matrix of a reversible Markov chain, with stationary distribution re. Let V be 
the set of states of the chain, and let £ be the set of oriented edges; i.e. 

£ = {(x,y) G V x V : P(x,y) > 0} . 

Forx,y G V\etT(x,y) be the set of all simple paths from x to y. LetT = U x ^ yGV T(x,y). 
A flow is a function / :Y — » [0, 1] such that for all x, y G V 

E /(?) = rcfaMy)- 

7€r(x,y) 

The edge Zcwd of an edge e G E is defined as 

/W = E/(7)l7l 

where |7| denotes the number of edges in 7. The congestion of a flow / is defined as 

1 

p(f) = max u\ f(( a > b ))- 
(a,b)e£ Tc(a)P(a,b) J 

Theorem 5' of [26] states that if the eigenvalues of P are 1 > A > A3 > • • • > A„ 
(where n = \ V\), then for any flow /, (1 — A) -1 < p(J). Furthermore, Theorem 8 in [26] 
shows that if P induces an ergodic Markov chain (i.e. if A n > —1), then there exists a 
flow /* such that p(f*) < 16r, where r is the mixing time of the chain. We call /* the 
optimal flow for P. 

The following (among other things) was proved in [15]: 

Theorem 2.14 (Benjamini et. al. Theorem 1.2) There exists C > such that if G is chosen 
according to G(n, p) with p > C log n/n then the mixing time of G{n, p) (which in this regime 
is connected) has a bound tG{n,v) = 0(log n) with probability tending to 1 as N — > 00. 

For us, G will be one of the graphs (0, A Nl ) . If 

de g^( 0/ A Nl )(*) = \iV e Sf(0,A Nl ) : oj {Xty) = 1}| (2.76) 

Then we let 

de g^(0,A Nl )(*) , x 

^(0,A Nl )W = ^ -T-f 7-y (2-77) 
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Of course, this is the stationary measure for SRW on ^(0, A Nl ). Let Gap^ An j denote 

the spectral gap associated to this Markov chain. 

A word about notation below. We will consider various graphs induced by long range 
percolation and random graph processes coupled to these induced graphs. Given such 
a graph G, we will use V(G) to refer to the vertices of G and S(G) will denote the set of 
edges of G. 

Lemma 2.15 (Lower Bound on Spectral Gap for (0, A Nl ) ) There exist universal constants 
c\\, c\2, $6 > so that ifN > N\ > Afc > N 3 > N4 be fixed as in (2.10) then 

u{Gap noANi) > cN*- s / log* N for all A Nl e V\R,H+) > 1 - c u (N / Ni) d e~ Cl2l °^ Nl 

(2.78) 

Proof. Due to the conditional independence built into our construction, if d Z d i00 (An 2 , A' N2 ) < 
kN2 then for each x £ ^(A Nl ) 

u[x^ &(A! N2 )\R,H+) < e -^ k+l ^ SN t s . (2.79) 

By construction, \&(A N2 ) \ > piN{~ d log 71 N\ and so 

|^(A N2 )|(/c + l)- s N 2 ^ s > (N 2 /N 1 ) d = Nl~ 2d log 3 Ni (2.80) 

FixablockAwj £ "Pnj- We may thus compare the block connectivity of (^(An 2 ))a N2 ca Ni 
to an Erdos-Renyi graph G(n, p) of size n = N 2rf ~ s / log 3 Ni with p = N^ 2rf log 3 Ni = 
(log 6 N\)/n. The comparison graph G(n, p) is, by our choice of N2, well in the super 
critical range. 

To increase the probability of our graph having the required properties to 1 — e~ clo 8 N 
we use the same amplification technique that was employed in Lemma 2.7. We may 
view the course grained block percolation process as containing [log 2 M/CJ indepen- 
dent identically distributed samples each distributed as G(n, p') (where C has been cho- 
sen sufficiently large so that Theorem 2.14 holds) and p' = CN^ 2d log N\ = C (log 4 N\)/n. 

Let {ERj)j l °f Nl/c J denote the course-grained i.i.d. copies and let <f^-(A Nl ) denote the 
Erdos Renyi random graph samples. From Theorem 2.14, we immediately conclude that 
except with probability at most e~ L lo s 2 N i/ c J / a t least one of the comparison Erdos-Renyi 
graphs has a mixing time of order 

T<ci 2 logN. (2.81) 

We choose any jo which satisfies (2.81) and define a multicommodity flow on % (0, A^ ) 
as in [4], also described in detail below. Thanks to Sinclair's work [26], for &38j the cou- 
pled Erdos-Renyi graph has on optimal flow f ER with p(f er) < 16c log N\. 

The lower bound on the spectral gap of (0, A^J, is derived by constructing a flow 
supported on (0, A Nl ) using the optimal flow f ER . We follow [4] rather closely. 
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Let 7%( /A j, TCg@. denote the stationary distribution of SRW on the respective graphs 

^(0,A Nl ) /( fi io (A Nl 70 ).Let 

I{% (0, A Nl ) ), T(£M k (A Nl ) ) (2.82) 
be the set of simple paths in (0, A Nl ), ( A Nl ), and let 

r(x, y; % (0, A Nl )), T(i, k; g& h (A Nl ) ) (2.83) 
be the set of simple paths in 

^ (0,A Nl ),^ (A Nl ) (2.84) 

from x toy and i to k, respectively. For a path 7, let 7 + be the starting vertex of 7, and 
let 7~ be the ending vertex of 7 (specifically for edges e = (e + ,e~)). 

For each i G <£^) , let A^ 2 (i) denote the block in ^ (0, A^) coupled to i. For G 
£(&£%j Q ) let e(/,fc) be a specific edge of £ (^ (0, A^J) given by the coupling with x G 
§f (An 2 (0) an d y £ ^(Aw 2 (fc)) (by definition, under our coupling there always exists at 
least one such edge). 

For each pair 1,1/6 5f(0, A^ 2 ) let 7(x, y) be a path in ^(0, Ajv 2 ) that realizes the graph 
distance between x and y in ^(0, An 2 ) (i.e. a geodesic). In case x = y let 7(x, x) be the 
empty path. 

For 77 G T(i,j;&a h (A Nl )), and x G V(S7(0, A N2 (i))),y G V(S7(0, A„ 2 (*))), define 
7(2/, x,y) G r(x,y;^ (0, A^J) by interpolating 77 using the specified edges e(z,/c) and 
geodesies 7(x,y). In other words, if rj = e\ei ■ ■ ■ e\„\, then 

j(t],x,y) = j(x,e+)e(e+ ,e^)j{e^ ,e+)e(e+ ,e 2 ) • • • e(e+ { ,e M )y(e w y). 

Setting A = maxA N2 cA Nl Diam(^(0, A Nz )) we get that \j(r],x,y)\ < (A + l)\r]\. 
As mentioned above, by Theorem 8 of [26], there exists a constant C\ > such that 

V(z,/c) ££{£%{A Nl )) (2.85) 

\£(^ jo (A Nl ))\ £ f ER (vM < 16t(^(A Ni )) < Cl logN. 

7er(^ ;p (A Nl )) 

rj3(i,k) 

We now define the flow / on (0, A Nl ). Let x,y G V(^(0, A^J), and let 2, A: be such 
thatx G Sf(0,A N2 (z)) and y G ^(0,A Nz (jfc)). 

Iff = k route all the flow along 7 (x,y) so/(7(x,y)) = 7% (0 ,A Ni )(x)7% (0 ,A Ni )(y). 

On the other hand, if i 7^ k, then for any 27 G \{i,k;S^^{A^),x G ^(0,Ajv 2 (0) an d 
y G ^(0,A N2 (/c)) set 

and otherwise. 

We bound the congestion of / along some edge (x, y) G £ (S£ (0, A^ ) ) . Let 2, k be such 
thatx G S7(0,An 2 (0) andy G S7(0, A N2 (/c)). 
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Case 1: i ^ k. In this case, any path 7 that contains the edge (x,y), such that f(y) > 
must be of the form 7 = j(rj,z,w) for some?; G 7(<f^ 7o (A]v 1 )) that contains (z,/c). Thus, 

E /(t)ItI 

7 er(<yo,A Nl )) 

73(x,y) 

x V- ^(0,A Nl )(z)^(0,A Nl )(w) 

- E E ^ few — t^t -/er( ? ?)I ? /I( a + 1 )- 

7 er(^ ;o (A Nl ))ze^(o / AN 2 ('7 + )) ^o( An i)^ ) n ^ jo (A Nl ){V ) 

It easily follows that the right hand side is bounded by 

(A + l) ■ (max7r^ 0/ANi) (^(0,A N2 (^)) 2 ) • \£(^ jo (A Nl ))\ 2 ■ £ /erMM- 

(2.86) 

Next, using (2.85), we can bound (2.86) by 

Cl log(N) • (A + l) • (max^ (0/ANi) (^(0,A N2 (£)) 2 ) • \£(^ jo (A Nl ))\. (2.87) 

Case 2: i = k. In this case, any path 7 that contains the edge (x, y), such that f(j) > 0, 
is one of the follwing: Either it is of the form 7 = 7(77, z, w) for some 77 G T(&£8i a (A^ ) ) 
that contains the vertex z, or it is of the form 7 = y(z,w) for some z,w G ^(0, An 2 (i')). 
Any path 7/ G T{${%j a (A^)) that contains the vertex i must contain some edge (i,k) G 
£(£& jo (A Nl )). Thus, using (2.10) and (2.87), 

E /(7)l7l 

7 er(^ (o,A Nl )) 

< E /(7(Z/»))l7(Z/0')| 

+ E EE f{y{t],z,w))\y{i],z,w)\ 

k:(i,k)eS(£@ k (A Nl )) r,eT(£Mj 9 (A Nl )) z€<#(A N2 ( n +)) 
r]B(i,k) we&{A Nz (ti-)) 

< A E 7r ^(0 / A Nl )( z )^(0,A Afl )(« ; ) 

z,a;G^(A N2 (0) 

_ _ 7% (0 ,a )(^(0,A N2 (7/ + )))7r mA x^A^Ciy-))) 
+ E E ! — = (A + 1)/eh(J7)|J7I 

/c:(a)6f(^ (A Nl )) I7 6r(^ ;? (A Nl )) n^ k (A Nl ){*l )^M j0 (A Nl ){V ) 

rjB(i,k) 

< A7t^ (0/Ani) (^(0,A N2 (0)) 2 

+ Cl log(N) -maxdeg^ (ANi) (i) • (A + l) • (max 7% (0,^(^(0, A N2 {1)) 2 ) • \£(^ jo (A Nl ))\. 

(2.88) 
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The following all hold with probability at least 1 — c ' e c lo s N : 

• By construction, we have Diam(^(0, A Nz )) < log 155 N for all Ajv 2 G Vn 2 - 

• By Lemma 2.10, Deg(tff) < log 2 N. 

• By Lemma 2.12 




\£ (^(0,A Nl ))| > ^(^(0,A Nl ))| > \p 2 N( 




(2.89) 



Finally, applying Theorem 5' of [26] to the flow / completes the lemma. 



□ 



Theorem 2.1. The volume bounds are established by Lemmas 2.6 and 2.12. The bound 
on the diameter follows by the facts that on the event R each A^ 2 G Vn 2 has diameter at 
most log 155 N and by the proof of Lemma 2.7. Finally Lemma 2.15 establishes the bound 



This theorem and the accompanying construction gives us a method of partitioning 
the largest component of & into connected sets which are approximately blocks but re- 
spect the graphs stucture and have good volume, diameter and spectral gap bounds. 
Moreoever, taking N = N\ establishes Theorems 2 and 3. 



In this section we prove heat kernel upper bounds on the basis of certain a priori 
hyptheses. At this point we leave the setting which motivates this paper, LRP. The goal is 
to formulate conditions on a graph G = (V, £ ) under which we may prove on-diagonal 
heat kernel upper bounds via the technology that appears in [1], see also the references 
there in. For convenience, we will work only with the continuous time SRW. Standard 
coupling arguments may be employed to obtain discrete time heat kernel bounds as well 
(see, for example, the appendix to [6]). 

Let us fix a (possibly infinite) connected graph Q = (V, £). Let ji(x) = deg(x) denote 
the (possibly non-normalizable) measure on Q which weights each vertex by its degree. 
Let C denote the generator of the continuous time simple random walk on Q, normalized 
to have unit holding times: 



on the spectral gap. 



□ 



3. An Abstract Continuous Time Heat Kernel Estimate 



my) = E 



deg(y) 



(/(z)-/Q/)) 



(3.1) 



HEAT KERNEL UPPER BOUNDS ON LONG RANGE PERCOLATION CLUSTERS 23 

Then £ is a self adjoint operator with respect to the Hilbert space L 2 (V,u) and 

(f>£f)r = ~\ E l<y,z>eW/( z ) -/(y)) 2 (3-2) 

Z y,zeV 

Let Pf(x,y) denote the corresponding transition kernel. 

For any finite connected subgraph H C Q and any vertex x G H let deg w (x) denote 
the degree of x within H. Let 

v w (x)=d egw (ac)/^(H) (3.3) 

withZ(W) = £ yGW deg w (y). 

Assume that for each s G [Ti, T2], there exists a distinguished connected subset B(s) 
and a partition V s of B(s) into connected sets {H : H € V s } and a pair of postive func- 
tions A S/ V s on R+ (we assume A s decreases and V s increases) and a family of universal 
constants {c,-, Q}| =1 so that the following assumptions hold: 

(1) For all H EV S , the spectral gap Gap w of the restriction of £ to H satisfies 

Gap w > A s . (3.4) 

(2) For all H G V s , the volume of H satisfies 

Cl V s < Vol(H) < CiV s . (3.5) 

(3) Suppose that for some 7, 8\ > 0, the function A s is linked to V s by the condition 

c 1 V-no ?) - k V s < A s . (3.6) 

(4) Further, let A-p s = min zGB( - s ) d ^|^ and suppose that there exist constants C3, ^ > 
so that 

txp, >c 3 log- j2 y s . (3.7) 

(5) Next let us suppose that there exists a subset Br C B(s) so that 

supP s (x,B(s) c ) <C 3 ^§^. (3.8) 

(6) Suppose that there exist C4 > so that 

2 + C 3 <#,V S log" 1 y s <C 4 . (3.9) 
Then we have the following abstract heat kernel bound: 

Lemma 3.1 Under the assumptions (l)-(6), consider 

ip t = P 2t (x,x)/deg(x) (3.10) 

for t G [Ti/2, T 2 /2] and x G Br. 

Let 5 = 2 + Si + 8~2 + 7. Tfen f/zere exz'si Cs, C6 > (depending only on 7 and t/ze constants 

C\, C\, C2, C3, C 3/ C 4/ C4), 

Ipt(x) < lp T , (X) A C5— ; ; , --r-n (3.11) 

rtv ; - ^Jxv ; ^ | log(l + C 6 (t — T 1 /2))|^/'r v 7 



24 N. CRAWFORD AND A. SLY 

when t e [Ti/2,T 2 /2]. 

Proof. Our various hypotheses will be explained over the course of the proof. Following 
Barlow, let / f (y ) = Then 

<pt=EfhyMy) = (ft,ft)j, (3.12) 

Differentiating, 

Ms = - E l<y,z>6£(G)(/s(z) -fs(y)) 2 - (3-13) 

y,z€V 

It is convenient to work with —d s xp s . By assumption, 

-Ms> E E l< y ,z)^(G)(/s(z)-/ s (y)) 2 . (3.14) 
HeV s y,z€H 

Since 

E i^^^W-M 2 )) 2 ( 3 - 15 ) 

is the Dirichlet form of / s for SRW on H, assumption (1) implies 

E \y,z)^(fs(y) - fs{z)f > A s Var w (/ s ) (3.16) 

y,zeH 

where 

Var w (/ S ) := E deg w (y)(/ s (y) - E w (/ S )) 2 . (3.17) 

yen 

and 

e w (/ s )= LMy)fs(y). (3.18) 

Thus 

-9 S ^ S > A s ^ E deg w (y)/ s 2 (y) - y^7v(E deg w (y)/ s ) 2 . (3.19) 

Since Eydeg (y)/ s (y) < 1, 

E Z(H)~ l ( E deg w (y)/ s (y)) 2 < maxZ(W)- 1 (3.20) 
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and 

^ E L de sn(y)fhy)-^(L de sn(y)fs) 2 

> A s E E deg w (y)/ s 2 (y) - A s max Z{H)- 1 
>A V M E deg(y)/ s 2 (y)-A Ps 1 maxZ(W)- 1 ) 

y€B(s) HGPs 

= ApA s (^-A Ps 1 maxZ(W)- 1 )- A v \ s I E deg(y)/ 2 (y) I (3.21) 

~ rtGFs \yGB( S ) c / 

Using reversibility and the fact that probabilities are bounded by 1, assumptions (2) 
and (5) imply that 

-3.*. > A V A S (fc - (1 + C ^°g ys ) (3.22) 

This explains some of our hypotheses: by assumption (6), tp s V s / log y s > C3 + 2. Ac- 
cording to hypotheses (3) and (4) 

-d s tp s > c'xp s V s -y log 2 ^^ 2 V s . (3.23) 

Under the further hypothesis ip s V s / log V s < C$, we thus obtain 

-d s xp s >c"tpl +rr /\\ogtp s \ s (3.24) 

where 5 = 2 + 5\ + ^2 + 7- 

Using the change of variables u s = 

d s u s >c"'{ 7 )/\og s u s (3.25) 

where we note u s > 1 (and increasing) and c"'(y) > is a constant depending only on 
the exponent 7. Integration by parts gives 

u s log* u s > c"'(j)s + u log 3 u + c"'(j)S f log^ 1 u s 

Jo 

>c'"(j)s + u log s u (3.26) 

since u s > 1 Vs. 

As f(u) = u log' u increases for u > 1, solving the equation 

u log' m = c"'s (3.27) 

for m gives us a lower bound on u s . Letting g(s) = c"'s/ log (c"'s) gives 

"s > g(s) (3.28) 

by direct calculation. 

Recalling that u s = tps~ 7 , the claim follows by appropriate choice of the constant C5 
since xp s is uniformly bounded by 1. □ 
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4. Proof of Theorem 1 

To apply Lemma 3.1 we need to check that the assumptions of that lemma hold. As- 
sumptions (l)-(4) will all follow by Theorem 2.1. Assumption (6) is a more of a cali- 
bration condition for our choice of scale V s than a stringent requirement. Thus the only 
assumption that needs further proof is (5), which will be given next. Finally, in the subse- 
quent subsection, we will gather all these results together to obtain the proof of Theorem 
1 

4.1 Estimates on the Growth of the Walk. Requirement (5) is most conveniently derived 
using the "the environment seen from the particle". Let Xf denote the random walk 
trajectory generated by P w (x,y). Recall the shift operation, r x from Section 2. By our 
assumption of translation invariance of the p X/ y, u is clearly translation invariant for 
all the shifts. The Kolmogorov — 1 law implies that u is ergodic with respect to the 
collection of shifts {T x } xGZ d. Given an initial environment to, Txf '■ CI — > Q defines a 
stochastic map on the space of environments. Let to t := Txf (to), with initial environment 
too — u>. h is clear that to t is Markov, since the underlying random walk is. 

Further, given an environment to, let d u (0) denote the degree of to at 0. Let dP(o;) = 
E ^J£^ du(to) and let us introduce the Hilbert space L 2 (P) = {/ : Q -» K : Ep(/ 2 ) < 

oo}, with inner product (f,g) := / dP(to)f(to)g(to). Note here that since s G (d,oo), 
E F [^(0)]<oo. 

It follows that the operator A t f(to) := f(to t ) is self adjoint since the underlying walk 
is reversible with respect to the un-normalized measure d w (x). Let Q(to,dto') denote 
the transition kernel for co t going from to to to' and let D t = maxo< I( <f |X„| denote the 
diameter of the walk at time t. 

Lemma 4.1 Let l/(s — d) < p. Then for either the discrete or continuous time process, there 
exists a constant c so that for any x G Z d , 

Pf (r> t > ctP +1 infinitely often ) = (4.1) 

u a.s. Moreover, there exist constants C\, 02,03 > such that for any T,A,p,r > with p as 
above and r < s — d, 

P({P a '(3f < T : |D f | > CiTP +1 ) > c 2 /T A }) < c 3 T A+1 -P r . (4.2) 
Proof. Discrete Time Case: Let us consider the increment of the walk at time n: 

\ n = \X„ — X n _i|. (4.3) 

Obviously 

D n <t,Ij (4.4) 

7=1 

Now Ii G I/(P) for any r < s — d. Let us begin by noting that by stationarity of the 
"environment viewed from the particle" process 

P(3t < n : I t > nP) < n 1 ^ '. (4.5) 
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But 

P(3f < n : I t > n?) = E F (Pg(3t < n : I t > n?)) (4.6) 

so we conclude that 

J2 p o( 3t < 2k ■ It > 2kp ) (4- 7 ) 
fc=i 

is summable P, and hence }i a.s. as long as pr > 1. 

Moreover, if we require quantitative bounds, these can be achieved using Markov's 
inequality: 

F({Pg(3t <n:I t >nP)> l/n A }) < n A+1 -P r (4.8) 
The continuous time case now follows easily by a coupling argument. □ 



4.2 The Upper Bound. The upper bound in Theorem 1 now follows by combining 
Corollary 2.2, Lemma 4.1 and Lemma 3.1. 

For the sake of completeness we will be explicit. Consider the event {x G C°°(a;)}. 
Hypothesis (6) of Lemma 3.1 sets the scale for our calculations. It is a general fact that, 
for any infinite connected graph Q, P t (z, z) < c/y/i [10] for some constant c independent 
of z G g. Thus if V t satisfies (6), then V t >c'y/i/ log t. 

Let k, r\ > be fixed. By Lemma 4.1, we can find p, r and a random variable T x (cv) so 
that 

P?(3u < t : ||D f || 2 > ci^) < c 2 /t A,{s -^ +K (4.9) 
for all t > f x (to) and moreover 

p(f x (co) > k) < C{y],K)k-^. (4.10) 

Choose e = 1/ (4p + 3d). By Corollary 2.2 and the translation invariance of ji, we can 
find a positive random variable T* (co) > so that the event 

T x (B(e,[2citH)) (4.11) 

holds for all t > T* £ with 

]i{T Xie >k)< C{£)e- C ^ l0 ^ k . (4.12) 

Finally, by Lemma 2.10 choosing 5 = r\ + 1, we can find a random variable T x *(co) so 
that for all t > T**(co), Deg(B x (2c 1 tP)) < 2ciplogf and 

}i(T**(cv) > k) < C(j/)fc - ' (4.13) 

Let T x (co) = max(T x (u>), T* e (to), T**(tu)) In the notation of Lemma 3.1, for each t > 
Tr(o;),let 

B(t) :=B x (2 Cl tP) 



28 



N. CRAWFORD AND A. SLY 



If t > T x (co) then by our choices above, we may take & t to be the minimal cover of 
B x (2citv) by boxes of side length \_V} ,A \ and 

V t :={Sf(0,A) : AG ff t } 
7 :=(s — d)/d 

Thus, for any t > T x (co), our choices verify Hypotheses (1), (2), (3), (4), (5) of Lemma 
3.1 with 5\ = 52 from Theorem 2.1 and 52 = 1. □ 
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